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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Davis, Roger J. 



Flavell, Richard A. 
Rakic, Pasko 
Whitmarsh, Alan 
Kuan, Chia-Yi 
Yang, Di 



(ii) TITLE OF INVENTION: JNK3 MODULATORS AND METHODS OF USE 



(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2. Ob 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/165,522 

(B) FILING DATE: 02-OCT-1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/060 , 995 

(B) FILING DATE: 03-OCT-1997 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fasse, J* Peter 

(B) REGISTRATION NUMBER: 32,983 

(C) REFERENCE /DOCKET NUMBER: 10363/005001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1505 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 68... 1459 



(iii) NUMBER OF SEQUENCES: 24 

(iv) CORRESPONDENCE ADDRESS:" 
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# • 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TTATGCAAGA AACTGTTGAA TTAGACCCGT TTCCTATAGA TGAGAAACCA TACAAGCTGT 60 
GGTATTT ATG AGC CTC CAT TTC TTA TAC TAC TGC ACT GAA CCA ACA TTG 109 
Met ser Leu Hie Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu 
1 5 10 



115 



130 



GAT GTG AAA ATT GCC TTT TGT CAG GGA TTC GAT AAA CAA GTG GAT GTG 157 
Asp Val Lys He Ala Phe Cys Gin Gly Phe Asp Lys Gin Val Asp Val 
!5 20 25 30 

TCA TAT ATT GCC AAA CAT TAC AAC ATG AGC AAA AGC AAA GTT GAC AAC :§P|W 
s« Sr lie Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn 
35 40 45 

PAr TTC TAC AGT GTG GAA GTG GGA GAC TCA ACC TTC ACA GTT CTC AAG 253. 12 I'JU 

Sn Phe Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys 
50 55 60 

CGC TAC CAG AAT CTA AAG CCT ATT GGC TCT GGG GCT CAG GGC ATA GTT 301 
Sg ™ Sn A^n Leu Lys Pro lie Gly Ser Gly Ala Gin Gly lie Val 
65 70 75 

TfT GCC GCG TAT GAT GCT GTC CTT GAC AGA AAT GTG GCC ATT AAG AAG 349 
Cys Ala Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys 
80 85 90 

CTC AGC AGA CCC TTT CAG AAC CAA ACA CAT GCC AAG AGA GCG TAC CGG 397 
Leu ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg 
95 100 105 HO 

GAG CTG GTC CTC ATG AAG TGT GTG AAC CAT AAA AAC ATT ATT AGT TTA 445 
lit Leu Val Leu Met Lys Cys Val Asn His Lys Asn lie He Ser Leu 
120 125 



TTA AAT GTC TTC ACA CCC CAG AAA ACG CTG GAG GAG TTC CAA GAT GTT 493 
Leu Asn Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val 



TAC TTA GTA ATG GAA CTG ATG GAT GCC AAC TTA TGT CAA GTG ATT CAG 541 
Tyr Leu Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val lie Gin 
7 145 150 155 

ATG GAA TTA GAC CAT GAG CGA ATG TCT TAC CTG CTG TAC CAA ATG TTG 589 
Me? Su Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu 
160 165 I 70 

TGT GGC ATT AAG CAC CTC CAT TCT GCT GGA ATT ATT CAC AGO GAT TTA 637 
Ss S? lie Lys His Leu His Ser Ala Gly He He His Arg Asp Leu 
!75 " 180 185 190 

AAA CCA AGT AAC ATT GTA GTC AAG TCT GAT TGC ACA TTG AAA ATC CTG 685 
Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu 
* 195 200 205 

GAC TTT GGA CTG GCC AGG ACA GCA GGC ACA AGC TTC ATG ATG ACT CCA 733 
tsi tt£ Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro 
210 215 220 

TAT GTG GTG ACA CGT TAT TAC AGA GCC CCT GAG GTC ATC CTG GGG ATG 781 
Tyr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met 
1 225 230 235 
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GGC TAC AAG GAG AAC GTG GAT ATA TGG TCT GTG GGA TGC ATT ATG GGA 829 
Gly Tyr Lys Glu Asn Val Asp He Trp Ser Val Gly Cya He Met Gly 
240 245 250 

GAA ATG GTT CGC CAC AAA ATC CTC TTT CCA GGA AGG GAC TAT ATT GAC 877 
Glu Met Val Arg His Lys He Leu Phe Pro Gly Arg Asp Tyr He Asp 
255 260 265 270 

CAG TGG AAT AAG GTA ATT GAA CAA CTA GGA ACA CCA TGT CCA GAA TTC 925 
Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe 
275 280 285 

ATG AAG AAA TTG CAA CCC ACA GTA AGA AAC TAT GTG GAG AAT CGG CCC 973 
Met Lys Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Aan Arg Pro 
290 295 300 

AAG TAT GCG GGA CTC ACC TTC CCC AAA CTC TTC CCA GAT TCC CTC TTC 1021 
Lvs Tyr Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe 
305 310 315 

CCA GCG GAC TCC GAG CAC AAT AAA CTC AAA GCC AGC CAA GCC AGG GAC 1069 
Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp 
320 325 330 

TTG TTG TCA AAG ATG CTA GTG ATT GAC CCA GCA AAA AGA ATA TCA GTG 1117 
Leu Leu Ser Lys Met Leu Val He Asp Pro Ala Lys Arg He Ser Val 
335 340 345 350 

GAC GAC GCC TTA CAG CAT CCC TAC ATC AAC GTC TGG TAT GAC CCA GCC 1165 
Asp Asp Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala 
355 360 365 

GAA GTG GAG GCG CCT CCA CCT CAG ATA TAT GAC AAG CAG TTG GAT GAA 1213 
Glu Val Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu 
370 375 380 

AGA GAA CAC ACA ATT GAA GAA TGG AAA GAA CTT ATC TAC AAG GAA GTA 1261 
Arq Glu His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val 
385 390 395 

ATG AAT TCA GAA GAA AAG ACT AAA AAT GGT GTA GTA AAA GGA CAG CCT 1309 
Met Asn Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Gly Gin Pro 
400 405 410 

TCT CCT TCA GGT GCA GCA GTG AAC AGC AGT GAG AGT CTC CCT CCA TCC 1357 
Ser Pro Ser Gly Ala Ala Val Asn Ser Ser Glu Ser Leu Pro Pro Ser 
415 420 425 430 

TCG TCT GTC AAT GAC ATC TCC TCC ATG TCC ACC GAC CAG ACC CTG GCA 1405 
Ser Ser Val Asn Asp He Ser Ser Met Ser Thr Asp Gin Thr Leu Ala 
435 440 445 

TCT GAC ACT GAC AGC AGC CTG GAA GCC TCG GCA GGA CCC CTG GGT TGT 1453 
Ser Asp Thr Asp Ser Ser Leu Glu Ala Ser Ala Gly Pro Leu Gly Cys 
450 455 460 

TGC AGG TGACTAGCCG CCTGCCTGCG AAACCCAGCG TTCTTCAGGA GATGAT 1505 
Cys Arg 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu Asp Val 

1 5 10 15 

Lvs He Ala Phe Cys Gin Gly Phe Asp Lys Gin Val Asp Val Ser Tyr 

1 20 25 30 

He Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn Gin Phe 

35 40 45 

Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr 

50 55 60 

Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He Val Cys Ala 
65 70 75 80 

Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser 

85 90 95 

Aro Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu 

100 105 HO 

Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu Leu Asn 

115 120 125 

Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu 

130 135 140 

Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu 
145 150 155 160 

Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly 

165 170 175 

He Lvs His Leu His Ser Ala Gly He He His Arg Asp Leu Lys Pro 
180 185 190 



1BU 

Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe 
195 200 205 

Ala Arg Thr Ala Gly Thr 
215 

Arg Tyr Tyr Arg Ala Pro 
230 

Asn Val Asp He Trp Ser 

245 " J 
Val Aro His Lys He Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp 
260 265 270 



195 zvv 
Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 

210 ~ 215 220 

Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr 
225 230 235 240 

Lvs Glu Asn Val Asp He Trp Ser Val Gly Cys He Met Gly Glu Met 
245 250 255 



2bU "3 *-' w 

Asn Lvs Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys 
275 280 285 



275 zou 
Lvs Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr 

290 295 300 

Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala 
305 310 315 320 

Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu 

325 330 335 

Ser Lys Met Leu Val He Asp Pro Ala Lys Arg He Ser Val Asp Asp 

340 345 350 

Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val 

355 360 365 

Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu 
370 375 380 
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His Thr 


He 


Glu 


Glu 


Trp 


Lys 


Glu 


385 








390 






Ser GLu 


Glu 


Lys 


Thr 


Lys 


Asn Gly 






405 








Ser Gly 


Ala 


Ala 


Val 


Asn 


Ser 


Ser 




420 










Val Asn 


Asp 
435 


He 


Ser 


Ser 


Met 


Ser 
440 


Thr Asp 


Ser 


Ser 


Leu 


Glu 


Ala 


Ser 


450 










455 







• 


• 




Ijcu 


Tift Tvir Lvfl 


Glu Val 


Met Asn 








400 




Val Lvs Glv 


Gin Pro 


Ser Pro 




410 




415 


VjXU 


Cot" T-ftll PfO 

OCX. WCU * i>V 


Pro Ser 


Ser Ser 


** *. 9 




430 




X I IX. 


Asp Gin Thr 


Leu Ala 


Ser Asp 




445 




Ala 


Glv Pro Leu 


Glv Cvs 


Cys Arg 




460 







(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GAGAAATGGC GTGGCAGGGG ACCCAGCGAG CCCAGAGGGA TTTTGCCGCT GCTTCCTCTA 60 

CCCCTGTATT TCACGCAGCT CTCTAAATTG ACTCAGCTCC AGGCTAGTGT GAGAAACACC 120 

AACAGCAGGC CCATCTCAGA TCTTCACTAT GGCAACTTAT GCAAGAAACT GTTGAATTAG 1Q0 

ACCCGTTTCC TATAGATGAG AAACCATACA AGCTGTGGTA TTTATGAGCC TCCATTTCTT 240 

ATACTACTGC AGTGAACCAA CATTGGATGT GAAAATTGCC TTTTGTCAGG GATTCGATAA 300 

ACAAGTGGAT GTGTCATATA TTGCCAAACA TTACAACATG AGCAAAAGCA AAGTTGACAA 360 

CCAGTTCTAC AGTGTGGAAG TGGGAGACTC AACCTTCACA GTTCTCAAGC GCTACCAGAA 420 

TCTAAAGCCT ATTGGCTCTG GGGCTCAGGG CATAGTTTGT GCCGCGTATG ATGCTGTCCT 480 

TGACAGAAAT GTGGCCATTA AGAAGCTCAG CAGACCCTTT CAGAACCAAA CACATGCCAA 540 

GAGAGCGTAC CGGGAGCTGG TCCTCATGAA GTGTGTGAAC CATAAAAACA TTATTAGTTT 600 

ATTAAATGTC TTCACACCCC AGAAAACGCT GGAGGAGTTC CAAGATGTTT ACTTAGTAAT 660 

GGAACTGATG GATGCCAACT TATGTCAAGT GATTCAGATG GAATTAGACC ATGAGCGAAT 720 

GTCTTACCTG CTGTACCAAA TGTTGTGTGG CATTAAGCAC CTCCATTCTG CTGGAATTAT 780 

TCACAGGGAT TTAAAACCAA GTAACATTGT AGTCAAGTCT GATTGCACAT TGAAAATCCT 840 

GGACTTTGGA CTGGCCAGGA CAGCAGGCAC AAGCTTCATG ATGACTCCAT ATGTGGTGAC 900 

ACGTTATTAC AGAGCCCCTG AGGTCATCCT GGGGATGGGC TACAAGGAGA ACGTGGATAT 960 

ATGGTCTGTG GGATGCATTA TGGGAGAAAT GGTTCGCCAC AAAATCCTCT TTCCAGGAAG 1020 

GGACTATATT GACCAGTGGA ATAAGGTAAT TGAACAACTA GGAACACCAT GTCCAGAATT 1080 

CATGAAGAAA TTGCAACCCA CAGTAAGAAA CTATGTGGAG AATCGGCCCA AGTATGCGGG 1140 

ACTCACCTTC CCCAAACTCT TCCCAGATTC CCTCTTCCCA GCGGACTCCG AGCACAATAA 1200 

ACTCAAAGCC AGCCAAGCCA GGGACTTGTT GTCAAAGATG CTAGTGATTG ACCCAGCAAA 1260 

AAGAATATCA GTGGACGACG CCTTACAGCA TCCCTACATC AACGTCTGGT ATGACCCAGC 1320 

CGAAGTGGAG GCGCCTCCAC CTCAGATATA TGACAAGCAG TTGGATGAAA GAGAACACAC 1380 

AATTGAAGAA TGGAAAGAAC TTATCTACAA GGAAGTAATG AATTCAGAAG AAAAGACTAA 1440 

AAATGGTGTA GTAAAAGGAC AGCCTTCTCC TTCAGGTGCA GCAGTGAACA GCAGTGAGAG 1500 

TCTCCCTCCA TCCTCGTCTG TCAATGACAT CTCCTCCATG TCCACCGACC AGACCCTGGC 1560 

ATCTGACACT GACAGCAGCC TGGAAGCCTC GGCAGGACCC CTGGGTTGTT GCAGGTGACT 1620 

AGCCGCCTGC CTGCGAAACC CAGCGTTCTT CAGGAGATGA TGTGATGGAA CACACACACA 1680 

CGCAGACACA CACACACACA CAAATGCAGA CACACAACAT CAAGAAAACA GCAAGGGAGA 1740 

GAATCCAAGC CTAAAATTAA ATAAATCTTT CAGCCTGCTT CTTCCCCAGG GTTCTGTATT 1800 

GCAGCTAAGC TCAAATGTAT ATTTAACTTC TAGTTGCTCT TGCTTTGGTC TTCTTCCAAT 1860 

GATGCTTACT ACAGAAAGCA AATCAGACAC AATTAGAGAA GCCTTTTCCA TAAAGTGTAA 1920 

TTTTAATGGC TGCAAAACCG GCAACCTGTA ACTGCCCTTT TAAATGGCAT GACAAGGTGT 1980 

GCAGTGGCCC CATCCAGCAT GTGTGTGTCT CTATCTTGCA TCTACCTGCT CCTTGGCCTA 2040 

GTCAGATGGA TGTAGATACA GATCCGCATG TGTCTGTATT CATACAGCAC TACTTACTTA 2100 

GAGATGCTAC TCTCAGTGTC CTCAGGGCTC TACCAAGACA TAATGCACTG GGGTACCACA 2160 

TGGTCCATTT CATGTGATCT ATTACTCTGA CATAAACCCA TCTGTAATAT ATTGCCAGTA 2220 

TATAAGCTGT TTAGTTTGTT AATTGATTAA ACTGTATGTC TTATAAGAAA ACATGTAAAG 2280 



60 - 



GGGGAATATA TTGGGGGAGT GAGCTCTCTC AGACCCTTGA AGATGTAGCT TCCAAATTTG 2340 
AATGGATTAA ATGGCACCTG TATACCA 2367 



(2) INFORMATION FOR SEQ ID NO: 4 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 92... 1357 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ATCTCAGATC TTCACTATGG CAACTTATGC AAGAAACTGT TGAATTAGAC CCGTTTCCTA 60 
TAGATGAGAA ACCATACAAG CTGTGGTATT T ATG AGC CTC CAT TTC TTA TAC 112 

Met Ser Leu His Phe Leu Tyr 
1 5 

TAC TGC AGT GAA CCA ACA TTG GAT GTG AAA ATT GCC TTT TGT CAG GGA 160 
Tyr Cys Ser Glu Pro Thr Leu Asp Val Lys He Ala Phe Cys Gin Gly 
10 15 20 

TTC GAT AAA CAA GTG GAT GTG TCA TAT ATT GCC AAA CAT TAC AAC ATG 208 
Phe Asp Lys Gin Val Asp Val Ser Tyr He Ala Lys His Tyr Asn Met 
25 30 35 

AGC AAA AGC AAA GTT GAC AAC CAG TTC TAC AGT GTG GAA GTG GGA GAC 256 
Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu Val Gly Asp 
40 45 50 55 

TCA ACC TTC ACA GTT CTC AAG CGC TAC CAG AAT CTA AAG CCT ATT GGC 304 
Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He Gly 
60 65 70 

TCT GGG GCT CAG GGC ATA GTT TGT GCC GCG TAT GAT GCT GTC CTT GAC 352 
Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala Val Leu Asp 
75 80 85 

AGA AAT GTG GCC ATT AAG AAG CTC AGC AGA CCC TTT CAG AAC CAA ACA 400 
Arq Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin Thr 
90 95 100 

CAT GCC AAG AGA GCG TAC CGG GAG CTG GTC CTC ATG AAG TGT GTG AAC 448 
His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val Asn 
105 HO H5 

CAT AAA AAC ATT ATT AGT TTA TTA AAT GTC TTC ACA CCC CAG AAA ACG 496 
His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro Gin Lys Thr 
12 0 125 130 135 

CTG GAG GAG TTC CAA GAT GTT TAC TTA GTA ATG GAA CTG ATG GAT GCC 544 
Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu Met Asp Ala 
140 145 150 
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AAC TTA TGT CAA GTG ATT CAG ATG GAA TTA GAC CAT GAG CGA ATG TCT 592 
Asn Leu Cys Gin Val lie Gin Met Glu Leu Asp His Glu Arg Met Ser 
155 160 165 

TAC CTG CTG TAC CAA ATG TTG TGT GGC ATT AAG CAC CTC CAT TCT GCT 640 
Tvr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu His Ser Ala 
7 170 175 180 

GGA ATT ATT CAC AGG GAT TTA AAA CCA AGT AAC ATT GTA GTC AAG TCT 688 
Glv He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys Ser 
185 190 195 

GAT TGC ACA TTG AAA ATC CTG GAC TTT GGA CTG GCC AGG ACA GCA GGC 736 
Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala Gly 



200 



ACA AGC TTC ATG ATG ACT CCA TAT GTG GTG ACA CGT TAT TAC AGA GCC 784 
Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg Ala 
220 225 230 

CCT GAG GTC ATC CTG GGG ATG GGC TAC AAG GAG AAC GTG GAT ATA TGG 832 
Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp He Trp 
235 240 245 

TCT GTG GGA TGC ATT ATG GGA GAA ATG GTT CGC CAC AAA ATC CTC TTT 880 
Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys He Leu Phe 
250 255 260 

CCA GGA AGG GAC TAT ATT GAC CAG TGG AAT AAG GTA ATT GAA CAA CTA 928 
Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin Leu 
265 270 275 

GGA ACA CCA TGT CCA GAA TTC ATG AAG AAA TTG CAA CCC ACA GTA AGA 976 
Glv Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val Arg 
280 285 290 295 



AAC TAT GTG GAG AAT CGG CCC AAG TAT GCG GGA CTC ACC TTC CCC AAA 1024 
Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr Phe Pro Lys 
300 305 310 

CTC TTC CCA GAT TCC CTC TTC CCA GCG GAC TCC GAG CAC AAT AAA CTC 1072 
Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His Asn Lys Leu 
315 320 325 

AAA GCC AGC CAA GCC AGG GAC TTG TTG TCA AAG ATG CTA GTG ATT GAC 1120 
Lvs Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He Asp 
* 330 335 340 

CCA GCA AAA AGA ATA TCA GTG GAC GAC GCC TTA CAG CAT CCC TAC ATC 1168 
Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His Pro Tyr He 
345 350 355 

AAC GTC TGG TAT GAC CCA GCC GAA GTG GAG GCG CCT CCA CCT CAG ATA 1216 
Asn val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro Pro Gin lie 

TAT GAC AAG CAG TTG GAT GAA AGA GAA CAC ACA ATT GAA GAA TGG AAA 1264 
Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp Lys 
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GAA CTT ATC TAC AAG GAA GTA ATG AAT TCA GAA GAA AAG ACT AAA AAT 1312 
Glu Leu He Tyr Lys Glu Val Met Asn Ser Glu Glu Lys Thr Lys Asn 
395 400 405 

GGT GTA GTA AAA GGA CAG CCT TCT CCT TCA GCA CAG GTG CAG CAG 1357 
Glv Val Val Lys Gly Gin Pro Ser Pro Ser Ala Gin Val Gin Gin 
1 410 415 420 

TGAACAGCAG TGAGAGTCTC CCTCCATCCT CGTCTGTCAA TGACATCTCC TCCATGTCCA 1417 

CCGACCAGAC CCTGGCATCT GACACTGACA GCAGCCTGGA AGCCTCGGCA GGACCCCTGG 1477 

GTTGTTGCAG GTGACTAGCC GCCTGCCTGC GAAACCCAGC GTTCTTCAGG AGATGATGTG 1537 

ATGGAACACA CACACACGCA GACACACACA CACACACAAA TGCAGACACA CAACATCAAG 1597 

AAAACAGCAA GGGAGAGAAT CCAAGCCTAA AATTAAATAA ATCTTTCAGC CTGCTTCTTC 1657 

CCCAGGGTTC TGTATTGCAG CTAAGCTCAA ATGTATATTT AACTTCTAGT TGCTCTTGCT 1717 

TTGGTCTTCT TCCAATGATG CTTACTACAG AAAGCAAATC AGACACAATT AGAGAA 1773 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro 

15 10 
Lys He Ala Phe Cys Gin Gly Phe Asp Lys Gin Val 

20 25 
He Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val 

35 40 
Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val 
50 55 60 

Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly 

65 70 75 

Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He 

85 90 
Aro Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala 

100 105 
Val Leu Met Lys Cys Val Asn His Lys Asn He He 

115 120 
Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin 
130 135 140 

Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val 
145 150 155 

Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin 

165 170 
He Lys HiB Leu His Ser Ala Gly He He His Arg 

180 185 
Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys 

195 200 
Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met 
210 215 220 

Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu 
225 * 230 235 

Lys Glu Asn Val Asp He Trp Ser Val Gly Cys He 
245 250 



Thr 


Leu Asp 


Val 




15 




Asp 


Val Ser 


Tyr 


30 




Asp 


Asn Gin 


Phe 


45 






Leu 


Lys Arg 


Tyr 


He 


Val Cys 


Ala 




80 


Lys 


Lys Leu 


Ser 




95 




Tyr 


Arg Glu 


Leu 




110 




Ser 


Leu Leu 


Asn 


125 






Asp 


Val Tyr 


Leu 


He 


Gin Met 


Glu 






160 


Met 


Leu Cys 


Gly 
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Asp 


Leu Lys 


Pro 




190 




He 


Leu Asp 


Phe 


205 






Thr 


Pro Tyr 


Val 


Gly 


Met Gly 


Tyr 




240 


Met 


Gly Glu 


Met 




255 
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Val 


Arg 


His 


Lys 


T 1 A 

lie 


Leu 


rile 


Pro 


\jiXy 




Asp 


Tip Ann Ci\ n 








260 










Zo5 










Asn 


Lys 


Val 


He 


Glu 


Gin 


Leu 


*»iy 


inr 


Pro 


H»»o Di*rt 


Alts Pho M«fc 
vjiu ir tits now 






275 










*5 Q A 

zoU 












Lys 


Leu 


Gin 


Pro 


Tnr 


Val 


TV m»mm 

Arg 


ABn 


Tyr 


Val 




jv m Prn T.vn 
/vty riu i-»y o 




290 










295 








inn 
3UU 






Ala 


Gly 


Leu 


fTI W mm 

Tnr 


Pne 


pro 


Lys 


T All 

Lieu 


rne 


rtO 


liars Cor 
nop otst 


T ah PhA Prn 
Lou * lits nu 


Ala 


305 








310 










lie 
319 




Jaw 


Asp 


Ser 


Glu 


His 


Asn 


Lys 


T Ma* 

Leu 


T mm 0m 

Lys 


Ala 

Aia 


£>er 


/"•In % 1 a 

uin Ala 


ATy A8p J-16U 


T All 








325 








330 








Ser 


Lys 


Met 


Leu 


Val 


He 


Asp 


Pro 


Ala 


Lys 


Arg He 


Ser Val Asp 


Asp 






340 










345 










Ala 


Leu 


Gin 


His 


Pro 


Tyr 


He 


Asn 


Val 


Trp 


xyr A8p 


rtO Ala ulu 


Vol 






355 








OCA 

360 








30D 




Glu 


Ala 
370 


Pro 


Pro 


Pro 


Gin 


lie 


Tyr 


Asp 


Lys 


pin T ah 
JOU 


nop ulU iu^ 


Glu 


His 


Thr 


He 


Glu 


Glu 


Trp 


Lys 


Glu 


Leu 


He 


Tyr Lys 


Glu Val Met 


Asn 


385 






















400 


Ser 


Glu 


Glu 


Lys 


Thr 


Lys 


Asn 


Gly 


Val 


Val 


Lys Gly 


Gin Pro Ser 


Pro 








405 








410 




415 




Ser 


Ala 


Gin 


Val 
420 


Gin 


Gin 

















(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2372 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GAGAAATGGC GTGGCAGGGG ACCCAGCGAG CCCAGAGGGA TTTTGCCGCT GCTTCCTCTA 60 

CCCCTGTATT TCACGCAGCT CTCTAAATTG ACTCAGCTCC AGGCTAGTGT GAGAAACACC 120 

AACAGCAGGC CCATCTCAGA TCTTCACTAT GGCAACTTAT GCAAGAAACT GTTGAATTAG 180 

ACCCGTTTCC TATAGATGAG AAACCATACA AGCTGTGGTA TTTATGAGCC TCCATTTCTT 240 

ATACTACTGC AGTGAACCAA CATTGGATGT GAAAATTGCC TTTTGTCAGG GATTCGATAA 300 

ACAAGTGGAT GTGTCATATA TTGCCAAACA TTACAACATG AGCAAAAGCA AAGTTGACAA 360 

CCAGTTCTAC AGTGTGGAAG TGGGAGACTC AACCTTCACA GTTCTCAAGC GCTACCAGAA 420 

TCTAAAGCCT ATTGGCTCTG GGGCTCAGGG CATAGTTTGT GCCGCGTATG ATGCTGTCCT 480 

TGACAGAAAT GTGGCCATTA AGAAGCTCAG CAGACCCTTT CAGAACCAAA CACATGCCAA 540 

GAGAGCGTAC CGGGAGCTGG TCCTCATGAA GTGTGTGAAC CATAAAAACA TTATTAGTTT 600 

ATTAAATGTC TTCACACCCC AGAAAACGCT GGAGGAGTTC CAAGATGTTT ACTTAGTAAT 660 

GGAACTGATG GATGCCAACT TATGTCAAGT GATTCAGATG GAATTAGACC ATGAGCGAAT 720 

GTCTTACCTG CTGTACCAAA TGTTGTGTGG CATTAAGCAC CTCCATTCTG CTGGAATTAT 780 

TCACAGGGAT TTAAAACCAA GTAACATTGT AGTCAAGTCT GATTGCACAT TGAAAATCCT 840 

GGACTTTGGA CTGGCCAGGA CAGCAGGCAC AAGCTTCATG ATGACTCCAT ATGTGGTGAC 900 

ACGTTATTAC AGAGCCCCTG AGGTCATCCT GGGGATGGGC TACAAGGAGA ACGTGGATAT 960 

ATGGTCTGTG GGATGCATTA TGGGAGAAAT GGTTCGCCAC AAAATCCTCT TTCCAGGAAG 1020 

GGACTATATT GACCAGTGGA ATAAGGTAAT TGAACAACTA GGAACACCAT GTCCAGAATT 1080 

CATGAAGAAA TTGCAACCCA CAGTAAGAAA CTATGTGGAG AATCGGCCCA AGTATGCGGG 1140 

ACTCACCTTC CCCAAACTCT TCCCAGATTC CCTCTTCCCA GCGGACTCCG AGCACAATAA 1200 

ACTCAAAGCC AGCCAAGCCA GGGACTTGTT GTCAAAGATG CTAGTGATTG ACCCAGCAAA 1260 

AAGAATATCA GTGGACGACG CCTTACAGCA TCCCTACATC AACGTCTGGT ATGACCCAGC 1320 

CGAAGTGGAG GCGCCTCCAC CTCAGATATA TGACAAGCAG TTGGATGAAA GAGAACACAC 1380 

AATTGAAGAA TGGAAAGAAC TTATCTACAA GGAAGTAATG AATTCAGAAG AAAAGACTAA 1440 

AAATGGTGTA GTAAAAGGAC AGCCTTCTCC TTCAGCACAG GTGCAGCAGT GAACAGCAGT 1500 

GAGAGTCTCC CTCCATCCTC GTCTGTCAAT GACATCTCCT CCATGTCCAC CGACCAGACC 1560 

CTGGCATCTG ACACTGACAG CAGCCTGGAA GCCTCGGCAG GACCCCTGGG TTGTTGCAGG 1620 
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TGACTAGCCG CCTGCCTGCG AAACCCAGCG TTCTTCAGGA GATGATGTGA TGGAACACAC 1680 

ACACACGGAG ACACACACAC ACACACAAAT GCAGACACAC AACATCAAGA AAACAGCAAG 1740 

GGAGAGAATC CAAGCCTAAA ATTAAATAAA TCTTTCAGCC TGCTTCTTCC CCAGGGTTCT 1800 

GTATTGCAGC TAAGCTCAAA TGTATATTTA ACTTCTAGTT GCTCTTGCTT TGGTCTTCTT 1860 

CCAATGATGC TTACTACAGA AAGCAAATCA GACACAATTA GAGAAGCCTT TTCCATAAAG 1920 

TGTAATTTTA ATGGCTGCAA AACCGGCAAC CTGTAACTGC CCTTTTAAAT GGCATGACAA 1980 

GGTGTGCAGT GGCCCCATCC AGCATGTGTG TGTCTCTATC TTGCATCTAC CTGCTCCTTG 2040 

GCCTAGTCAG ATGGATGTAG ATACAGATCC GCATGTGTCT GTATTCATAC AGCACTACTT 2100 

ACTTAGAGAT GCTACTCTCA GTGTCCTCAG GGCTCTACCA AGACATAATG. CACTGGGGTA 2160 

CCACATGGTC CATTTCATGT GATCTATTAC TCTGACATAA ACCCATCTGT AATATATTGC 2220 

CAGTATATAA GCTGTTTAGT TTGTTAATTG ATTAAACTGT ATGTCTTATA AGAAAACATG 2280 

TAAAGGGGGA ATATATTGGG GGAGTGAGCT CTCTCAGACC CTTGAAGATG TAGCTTCCAA 2340 

ATTTGAATGG ATTAAATGGC ACCTGTATAC CA 2372 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 224... 1489 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAGAAATGGC GTGGCAGGGG ACCCAGCGAG CCCAGAGGGA TTTTGCCGCT GCTTCCTCTA 60 
CCCCTGTATT TCACGCAGCT CTCTAAATTG ACTCAGCTCC AGGCTAGTGT GAGAAACACC 120 
AACAGCAGGC CCATCTCAGA TCTTCACTAT GGCAACTTAT GCAAGAAACT GTTGAATTAG 180 
ACCCGTTTCC TATAGATGAG AAACCATACA AGCTGTGGTA TTT ATG AGO CTC CAT 235 

Met Ser Leu His 
1 

TTC TTA TAC TAC TGC AGT GAA CCA ACA TTG GAT GTG AAA ATT GCC TTT 283 
Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu Asp Val Lys He Ala Phe 
5 10 15 20 

TGT CAG GGA TTC GAT AAA CAA GTG GAT GTG TCA TAT ATT GCC AAA CAT 331 
Cys Gin Gly Phe Asp Lys Gin Val Asp Val Ser Tyr He Ala Lys His 
25 30 35 

TAC AAC ATG AGC AAA AGC AAA GTT GAC AAC CAG TTC TAC AGT GTG GAA 379 
Tyr Asn Met Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu 
40 45 50 

GTG GGA GAC TCA ACC TTC ACA GTT CTC AAG CGC TAC CAG AAT CTA AAG 427 
Val Gly Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys 
55 60 65 

CCT ATT GGC TCT GGG GCT CAG GGC ATA GTT TGT GCC GCG TAT GAT GCT 475 
Pro He Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala 
70 75 80 

GTC CTT GAC AGA AAT GTG GCC ATT AAG AAG CTC AGC AGA CCC TTT CAG 523 
Val Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin 
85 90 95 100 
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1 



AAC CAA ACA CAT GCC AAG AGA GCG TAC CGG GAG CTG GTC CTC ATG AAG 571 
Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys 
105 110 115 

TGT GTG AAC CAT AAA AAC ATT ATT AGT TTA TTA AAT GTC TTC ACA CCC 619 
Cva Val Asn His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro 
120 125 130 

CAG AAA ACG CTG GAG GAG TTC CAA GAT GTT TAC TTA GTA ATG GAA CTG 667 
Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu 
135 140 145 

ATG GAT GCC AAC TTA TGT CAA GTG ATT CAG ATG GAA TTA GAC CAT GAG 715 
Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu 
150 155 160 

CGA ATG TCT TAC CTG CTG TAC CAA ATG TTG TGT GGC ATT AAG CAC CTC 763 
Arq Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly He Lys His Leu 
165 170 175 180 

CAT TCT GCT GGA ATT ATT CAC AGG GAT TTA AAA CCA AGT AAC ATT GTA 811 
His Ser Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val 
185 190 195 

GTC AAG TCT GAT TGC ACA TTG AAA ATC CTG GAC TTT GGA CTG GCC AGG 859 
Val Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg 
200 205 210 

ACA GCA GGC ACA AGC TTC ATG ATG ACT CCA TAT GTG GTG ACA CGT TAT 907 
Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr 
215 220 225 

TAC AGA GCC CCT GAG GTC ATC CTG GGG ATG GGC TAC AAG GAG AAC GTG 955 
Tyr Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val 
230 235 240 

GAT ATA TGG TCT GTG GGA TGC ATT ATG GGA GAA ATG GTT CGC CAC AAA 1003 
Asp He Trp Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys 
245 250 255 260 

ATC CTC TTT CCA GGA AGG GAC TAT ATT GAC CAG TGG AAT AAG GTA ATT 1051 
He Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He 
265 270 275 

GAA CAA CTA GGA ACA CCA TGT CCA GAA TTC ATG AAG AAA TTG CAA CCC 1099 
Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro 
280 285 290 

ACA GTA AGA AAC TAT GTG GAG AAT CGG CCC AAG TAT GCG GGA CTC ACC 1147 
Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr 
295 300 305 

TTC CCC AAA CTC TTC CCA GAT TCC CTC TTC CCA GCG GAC TCC GAG CAC 1195 
Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His 
310 315 320 

AAT AAA CTC AAA GCC AGC CAA GCC AGG GAC TTG TTG TCA AAG ATG CTA 1243 
Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu 
325 330 335 340 
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GTG ATT GAC CCA GCA AAA AGA ATA TCA GTG GAC GAC GCC TTA CAG CAT 1291 
Val He Asp Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His 
345 350 355 

CCC TAC ATC AAC GTC TGG TAT GAC CCA GCC GAA GTG GAG GCG CCT CCA 1339 
Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro 
360 365 370 

CCT CAG ATA TAT GAC AAG CAG TTG GAT GAA AGA GAA CAC ACA ATT GAA 1387 
Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu 
375 380 385 

GAA TGG AAA GAA CTT ATC TAC AAG GAA GTA ATG AAT TCA GAA GAA AAG 1435 
Glu Trp Lys Glu Leu He Tyr Lys Glu Val Met Asn Ser Glu Glu Lys 
390 395 400 

ACT AAA AAT GGT GTA GTA AAA GGA CAG CCT TCT CCT TCA GCA CAG GTG 1483 
Thr Lys Asn Gly Val Val Lys Gly Gin Pro Ser Pro Ser Ala Gin Val 
405 410 415 420 

CAG CAG TGAACAGCAG TGAGAGTCTC CCTCCATCCT CGTCTGTCAA TGACATCTCC 1539 
Gin Gin 

TCCATGTCCA CCGACCAGAC CCTGGCATCT GACACTGACA GCAGCCTGGA AGCCTCGGCA 1599 

GGACCCCTGG GTTGTTGCAG GTGACTAGCC GCCTGCCTGC GAAACCCAGC GTTCTTCAGG 1659 

AGATGATGTG ATGGAACACA CACACACGCA GACACACACA CACACACAAA TG CAGA CACA 1719 

CAACATCAAG AAAACAGCAA GGGAGAGAAT CCAAGCCTAA AATTAAATAA ATCTTTCAGC 1779 

CTGCTTCTTC CCCAGGGTTC TGTATTGCAG CTAAGCTCAA ATGTATATTT AACTTCTAGT 1839 

TGCTCTTGCT TTGGTCTTCT TCCAATGATG CTTACTACAG AAAGCAAATC AGACACAATT 1899 

AGAGAAGCCT TTTCCATAAA GTGTAATTTT AATGGCTGCA AAACCGGCAA CCTGTAACTG 1959 

CCCTTTTAAA TGGCATGACA AGGTGTGCAG TGGCCCCATC CAGCATGTGT GTGTCTCTAT 2019 

CTTGCATCTA CCTGCTCCTT GGCCTAGTCA GATGGATGTA GATACAGATC CGCATGTGTC 2079 

TGTATTCATA CAGCACTACT TACTTAGAGA TGCTACTCTC AGTGTCCTCA GGGCTCTACC 2139 

AAGACATAAT GCACTGGGGT ACCACATGGT CCATTTCATG TGATCTATTA CTCTGACATA 2199 

AACCCATCTG TAATATATTG CCAGTATATA AGCTGTTTAG TTTGTTAATT GATTAAACTG 2259 

TATGTCTTAT AAGAAAACAT GTAAAGGGGG AATATATTGG GGGAGTGAGC TCTCTCAGAC 2319 

CCTTGAAGAT GTAGCTTCCA AATTTGAATG GATTAAATGG CACCTGTATA CCA 2372 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



15 10 
;,ys He Ala Phe Cys Gin Gly Phe Asp Lys 

20 25 
Cle Ala Lys His Tyr Asn Met Ser Lys Ser 

35 40 
ryr Ser Val Glu Val Gly Asp Ser Thr Phe 

50 55 
3ln Asn Leu Lys Pro He Gly Ser Gly Ala 
65 70 



NO: 8: 




Glu 


Pro 


Thr Leu Asp Val 






15 


Gin 


Val 


Asp Val Ser Tyr 






30 


Lys 


Val 


Asp Asn Gin Phe 




45 


Thr 


Val 


Leu Lys Arg Tyr 




60 




Gin 


Gly 


He Val Cys Ala 


75 


80 
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Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser 
85 90 95 



115 120 125 

Thr 

Glu 

145 150 xoo 

Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly 



115 i^u ~— - 

Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu 

130 135 140 

Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val He Gin Met Glu 

150 155 l fe0 



Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu 

* 100 105 HO 

Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu Leu Asn 

i 125 

Glu Glu Phe Gin Asp Val Tyr 
140 

Leu Cys Gin Val He Gin Met 
155 

Leu Leu Tyr Gin Met Leu Cys 

165 I 70 175 

He He His Arg Asp Leu 

180 185 190 

Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys lie Leu Asp Phe 

195 200 205 

Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 

210 215 220 

Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val lie Leu Gly Met Gly Tyr 

III Glu Asn Val Asp lie Trp Ser Val Gly Cys lie Met Gly Glu Met 

245 250 255 

Val Arg His Lys He Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp 
260 265 270 



lie Lys His Leu His Ser Ala Gly lie lie His Arg Asp Leu Lys Pro 



Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys 

280 285 

Asn Tyr Val Glu Asn Arg 
300 

Leu Phe Pro Asp Ser Leu 

q 0 c J i u 315 

Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu 

325 330 335 

Ser Lys Met Leu Val He Asp Pro Ala Lys Arg He Ser Val Asp Asp 
340 345 350 



275 * ww 

Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr 

290 295 300 
Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp ser Leu Phe Pro Ala 
305 310 315 320 



355 jww 

Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu 





Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val 

360 365 
Lys Gin Leu Asp 
375 - ' 3 »0 

He Tyr Lys Glu 
395 

Val Lys Gly Gin 
405 410 415 



His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val Met Asn 
^oc 390 395 

Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Gly Gin Pro Ser Pro 

405 410 415 

Ser Ala Gin Val Gin Gin 
420 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1975 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 364... 1641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCCTCCTTAT TCCGGTTTGG AATGTGGCTA ATGAAAGCCC AGTAGGAGGA TTTCTGGGGC 
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60 



# 



AAACAGGTGG ACCAGGATCC TGGTTCTCAG GCACGGAATG GCTATTGTGA GAGCGCCACC 120 

AGCAGGACCA TCGCAGATCT TGGTTATGGC TGCTCACGCA AGAGGCTGTT GATGTAGACC 180 

CCCTTTCCCG TAGATGAGAA ATCACACGAG CAGTGGTATT TATGAGCCTC CATTTCTTAT 240 

ACTACTGCAG TGAACCAACC TTGGATGTGA AAATTGCCTT TTGTCAGGTG TGTGTTCCTT 300 

ACAGGTAAAA CAAAGGGATT CGACAAACAC GTGGATGTGT CTTCTGTTGT CAAACATTAC 360 

AAC ATGAGC AAA AGO AAG GTA GAT AAC CAG TTC TAC AGT GTG GAA GTC 408 
Met Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu Val 
1 5 10 15 

GGA GAC TCA ACC TTC ACA GTT CTA AAG CGC TAC CAG AAC CTG AAG CCG 456 
Glv Asp Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro 
y 20 25 30 

ATC GGC TCT GGG GCT CAG GGA ATA GTT TGT GCT GCG TAT GAC OCT GTC 504 
He Gly Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala Val 
35 40 45 

CTC GAC AGA AAT GTG GCC ATT AAG AAG CTC AGC AGA CCC TTC CAG AAC 552 
Leu Asp Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn 
50 55 60 

CAA ACT CAT GCC AAG AGG GCT TAC CGG GAG CTG GTC CTC ATG AAG TGT 600 
Gin Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys 
65 70 75 

GTG AAC CAT AAA AAC ATT ATT AGC TTA TTA AAT GTC TTT ACA CCC CAG 648 
Val Asn His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro Gin 
80 85 90 95 

AAA ACA CTG GAG GAG TTC CAA GAT GTT TAC TTA GTG ATG GAA CTG ATG 696 
Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu Met 
100 105 HO 

GAC GCC AAC TTG TGT CAG GTG ATT CAG ATG GAG CTG GAC CAC GAG CGG 744 
Asp Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg 
115 120 125 

ATG TCG TAC TTG CTG TAC CAG ATG CTG TCG GCG ATC AAA CAC CTC CAC 792 
Met Ser Tyr Leu Leu Tyr Gin Met Leu Ser Ala He Lys His Leu His 

TCC GCT GGG ATC ATC CAC AGG GAC TTA AAA CCC AGT AAC ATC GTA GTC 840 
Ser Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val 
145 150 155 

AAG TCT GAT TGC ACA CTG AAA ATC CTG GAC TTT GGA CTG GCC AGG ACA 888 
Lys s2 Asp C*s Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr 

rcc GGC ACA AGC TTC ATG ATG ACT CCG TAT GTG GTG ACG AGA TAT TAC 936 
aS 25 JS Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr 
180 185 190 

AGA GCC CCC GAG GTC ATC CTG GGC ATG GGC TAC AAG GAG AAC GTG GAC 984 
Arq Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp 
y 200 205 



195 



ATA TGG TCT GTG GGC TGC ATC ATG GGA GAA ATG GTT CGT CAC AAA ATC 1032 
He Trp Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys He 
210 215 220 
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CTC TTT CCC GGA AGG GAC TAT ATT GAC CAG TGG AAC AAA GTC ATA GAG 
Leu Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val lie Glu 
225 230 235 

CAG CTA GGA ACT CCG TGT CCA GAA TTC ATG AAG AAA TTG CAG CCC ACC 
Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr 
240 245 250 255 

GTC AGA AAC TAC GTG GAG AAC CGG CCC AAG TAT GCA GGC CTC ACC TTC 
Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr Phe 
260 265 270 

CCC AAG CTC TTT CCA GAT TCC CTC TTC CCA GCG GAT TCC GAG CAC AAT 
Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His Asn 
275 280 285 

AAA CTT AAA GCC AGC CAA GCC AGG GAC TTG TTG TCA AAG ATG TTA GTG 
i£ Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val 
290 295 300 

ATT GAC CCA GCG AAG AGG ATA TCG GTG GAT GAC GCA TTG CAG CAT CCG 
ill Asp Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His Pro 
305 310 315 

TAC ATC AAC GTT TGG TAC GAC CCT GCT GAA GTG GAG GCG CCT CCG CCT 
S£ lie Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro Pro 
320 3 25 33 ° 335 

CAG ATA TAT GAC AAG CAA TTG GAT GAA AGG GAG CAC ACC ATC GAA GAA 
S Tyr Asp Lys Gin Leu Asp Glu Arg Glu His Thr lie Glu Glu 
340 345 J:>u 

TGG AAA GAA CTC ATC TAC AAG GAA GTA ATG AAC TCA GAA GAG AAG ACT 
Trp Lys Glu Leu lie Tyr Lys Glu Val Met Asn Ser Glu Glu Lys Thr 
355 360 365 

AAG AAC GGC GTA GTC AAA GGC CAG CCC TCA CCT TCA GGT GCA GCA GTG 
Lys Asn Gly Val Val Lys Gly Gin Pro Ser Pro Ser Gly Ala Ala Val 
370 375 380 

AAC AGC AGT GAG AGT CTC CCT CCA TCC TCA TCT GTC AAC GAC ATC TCC 
™ ser Ser Glu Ser Leu Pro Pro Ser Ser Ser Val Asn Asp lie Ser 
385 390 395 

TCC ATG TCC ACC GAC CAG ACC CTC GCA TCC GAC ACT GAC AGC AGC CTG 
Ser Met Ser Thr Asp Gin Thr Leu Ala Ser Asp Thr Asp Ser Ser Leu 
400 405 410 415 

GAA GCC TCG GCG GGA CCG CTG GGT TGT TGC AGG TGACTAGCCG CCTGCCTGCG 
Glu Ala Ser Ala Gly Pro Leu Gly Cys Cys Arg 
420 425 



AAACCCAGCG 
AGAGCTTGTA 
CACGCACGCA 
AGAGAGCCAA 
CAGCTGAGCT 
CGGCAGTGCT 



TTCTTCAGGA 
CACACACACA 
CAAATGCACT 
CGTAAAACTA 
GAAATGTATA 
TACT 



GATGACGCCA 
CACACACACA 
CACGCAATGT 
AGTTAAATCT 
CTTAACTTCT 



TGATAGAACA 
CACACGCACG 
CAAGAAAAAA 
TTCTGCGTGC 
AGTCGCGCTC 



CAGCGCACAT 
CACGCACGCA 
AAAAGTAGCG 
TTCTCCAGAG 
GCTCGACTTT 



GCACACACAC 
CGCAAGCACG 
AGAGAGAGCG 
TTCTGTATCG 
GGTCTCCCTC 



1080 



1128 



1176 



1224 



1272 



1320 



1368 



1416 



1464 



1512 



1560 



1608 



1661 



1721 
1781 
1841 
1901 
1961 
1975 
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(2) INFORMATION FOR SEQ ID NO I 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Lys Ser Lys Val Asp Asn Gin Phe Tyr Ser Val Glu Val Gly 

1 5 1° 15 

Aso Ser Thr Phe Thr Val Leu Lys Arg Tyr Gin Asn Leu Lys Pro He 

" 20 25 30 

Glv Ser Gly Ala Gin Gly He Val Cys Ala Ala Tyr Asp Ala Val Leu 

35 40 45 

Aso Arg Asn Val Ala He Lys Lys Leu Ser Arg Pro Phe Gin Asn Gin 

50 55 60 

Thr His Ala Lys Arg Ala Tyr Arg Glu Leu Val Leu Met Lys Cys Val 
65 70 75 80 

Asn His Lys Asn He He Ser Leu Leu Asn Val Phe Thr Pro Gin Lys 

85 90 95 

Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu Val Met Glu Leu Met Asp 

100 105 HO 

Ala Asn Leu Cys Gin Val He Gin Met Glu Leu Asp His Glu Arg Met 

115 120 125 

Ser Tyr Leu Leu Tyr Gin Met Leu Ser Ala He Lys His Leu His Ser 

130 135 140 

Ala Gly He He His Arg Asp Leu Lys Pro Ser Asn He Val Val Lys 
145 150 155 160 

Ser Asp Cys Thr Leu Lys He Leu Asp Phe Gly Leu Ala Arg Thr Ala 

Gly Thr Ser Phe Met Met Thr Pro Tyr Val Val Thr Arg Tyr Tyr Arg 

180 185 190 

Ala Pro Glu Val He Leu Gly Met Gly Tyr Lys Glu Asn Val Asp He 

195 200 205 

Trp Ser Val Gly Cys He Met Gly Glu Met Val Arg His Lys He Leu 

210 " 215 220 

Phe Pro Gly Arg Asp Tyr He Asp Gin Trp Asn Lys Val He Glu Gin 

Leu Gly Thr Pro Cys Pro Glu Phe Met Lys Lys Leu Gin Pro Thr Val 

245 250 255 

Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr Ala Gly Leu Thr Phe Pro 

260 265 270 

Lvs Leu Phe Pro Asp Ser Leu Phe Pro Ala Asp Ser Glu His Asn Lys 

* 275 280 285 

Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu Ser Lys Met Leu Val He 

290 295 300 

Aso Pro Ala Lys Arg He Ser Val Asp Asp Ala Leu Gin His Pro Tyr 
305 310 315 320 

He Asn Val Trp Tyr Asp Pro Ala Glu Val Glu Ala Pro Pro Pro Gin 

325 330 335 

He Tvr Asp Lys Gin Leu Asp Glu Arg Glu His Thr He Glu Glu Trp 

3 340 345 350 

Lys Glu Leu He Tyr Lys Glu Val Met Asn Ser Glu Glu Lys Thr Lys 

355 ' 360 365 

Asn Gly Val Val Lys Gly Gin Pro Ser Pro Ser Gly Ala Ala Val Asn 
370 375 380 
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Ser Ser Glu Ser Leu Pro Pro Ser Ser Ser Val Asn Asp lie Ser Ser 
385 390 395 400 

Met Ser Thr Asp Gin Thr Leu Ala Ser Asp Thr Asp Ser Ser Leu Glu 

405 410 415 

Ala Ser Ala Gly Pro Leu Gly Cys Cys Arg 
420 425 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2522 base pairs 

(B) TYPE: .nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 310... 1575 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGGGCTTGAG TGAGCTAAAG ATTGGGTCTT CTTGGAAATC ACCTGTCTGT TATTATTTTT 60 

AAACAATCGC TACACCTCCA AAGACTCTGC TCCTTACTCC GGTTTGGAAT GTGGCTAATG 120 

ACTACCCAGT AGGGAGGATT TCTGGGGCAA ACAGCCGGAC CAGGATCCTA GTTCTCAGGC 180 

ACGGAATGGC TATTGTGAGA ACAGCACCAG CAGGATCATC GCAGATCTTG GTTATGGCCA 240 

CTCAGGCAAG ACGCTGTTGA GTTAAGACCC CTTTCCCATA GATGAGAAGC CACAGAAGCA 300 

GTGGTATTT ATG AGO CTC CAT TTC TTA TAC TAG TGC AGT GAA CCA ACC TTG 351 
Met Ser Leu His Phe Leu Tyr Tyr Cys Ser Glu Pro Thr Leu 
1 5 10 

GAT GTG AAA ATT GCC TTT TGT CAG GGA TTC GAT AAA CAC GTG GAT GTG 399 
Asp Val Lys He Ala Phe Cys Gin Gly Phe Asp Lys His Val Asp Val 
15 20 25 30 

TCA TCT ATT GCC AAA CAT TAC AAC ATG AGC AAA AGC AAG GTG GAC AAC 447 
Ser Ser He Ala Lys His Tyr Asn Met Ser Lys Ser Lys Val Asp Asn 
35 40 45 

CAG TTC TAC AGT GTG GAA GTG GGG GAC TCA ACC TTC ACC GTT CTT AAG 495 
Gin Phe Tyr Ser Val Glu Val Gly Asp Ser Thr Phe Thr Val Leu Lys 
50 55 60 

CGC TAC CAG AAC CTG AAG CCA ATT GGC TCT GGG GCT CAG GGA ATA GTC 543 
Arg Tyr Gin Asn Leu Lys Pro He Gly Ser Gly Ala Gin Gly He Val 
65 70 75 

TGT GCT GCG TAC GAC GCT GTC CTT GAC AGA AAT GTG GCC ATT AAG AAG 591 
Cvs Ala Ala Tyr Asp Ala Val Leu Asp Arg Asn Val Ala He Lys Lys 
80 85 90 

CTC AGC AGA CCC TTC CAG AAC CAA ACT CAC GCC AAG AGG GCT TAC CGG 639 
Leu Ser Arg Pro Phe Gin Asn Gin Thr His Ala Lys Arg Ala Tyr Arg 
95 ' 100 105 HO 

GAG CTG GTG CTC ATG AAG TGT GTG AAC CAT AAA AAC ATT ATT AGC TTA 687 
Glu Leu Val Leu Met Lys Cys Val Asn His Lys Asn He He Ser Leu 
115 120 125 
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TTA AAT GTT TTT ACA CCC CAG AAA ACG CTG GAG GAG TTC CAA GAT GTC 735 
Leu Asn Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val 
130 135 140 

TAC TTA GTG ATG GAA CTG ATG GAC GCC AAC CTG TGT CAG GTG ATT CAG 783 
Tyr Leu Val Met Glu Leu Met Asp Ala Asn Leu Cys Gin Val lie Gin 
145 150 155 

ATG GAG CTG GAC CAC GAG CGG ATG TCT TAC TTG CTG TAC CAG ATG CTG 831 
Met Glu Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu 
160 165 170 

TGT GGC ATC AAG CAC CTC CAC TCC GCT GGG ATC ATC CAC AGG GAC TTA 879 
Cvs Glv He Lys His Leu His Ser Ala Gly He He His Arg Asp Leu 
175 180 185 190 

AAA CCC AGT AAC ATT GTA GTC AAG TCT GAT TGC ACA CTG AAA ATC CTC 927 
Lys Pro Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu 
1 195 200 205 

GAC TTC GGA CTG GCC AGG ACA GCG GGT ACA AGC TTC ATG ATG ACT CCG 975 
Asp Phe Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro 
210 215 220 

TAT GTG GTG ACG CGA TAT TAC AGA GCC CCT GAG GTC ATC CTG GGC ATG 1023 
Tvr Val Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met 
225 230 235 

GGC TAC AAG GAG AAC GTG GAC ATA TGG TCT GTG GGA TGC ATC ATG GGA 1071 
Glv Tyr Lys Glu Asn Val Asp He Trp Ser Val Gly Cys He Met Gly 
240 245 250 

GAA ATG GTT CGC CAC AAA ATC CTC TTT CCC GGA AGG AGC TAT ATT GAC 1119 
Glu Met Val Arg His Lys He Leu Phe Pro Gly Arg Ser Tyr He Asp 
255 260 265 270 

CAG TGG AAC AAA GTC ATC GAG CAG CTA GGA ACT CCG TGT CCA GAG TTC 1167 
Gin Trp Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe 
275 280 285 

ATG AAG AAA TTG CAG CCC ACA GTC AGA AAC TAC GTG GAG AAT CGG CCC 1215 
Met Lvs Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro 
290 295 300 

AAG TAC GCA GGA CTC ACC TTC CCC AAG CTC TTT CCA GAT TCC CTC TTC 1263 
Lvs Tvr Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe 
305 310 315 

CCA GCG GAT TCT GAG CAC AAT AAA CTT AAA GCC AGC CAA GCC AGG GAT 1311 
Pro Ala Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp 
320 325 330 

TTG TTG TCT AAG ATG TTA GTG ATT GAC CCA GTG AAG AGG ATA TCG GTG 1359 
Leu Leu Ser Lys Met Leu Val He Asp Pro Val Lys Arg He Ser Val 
335 340 345 350 

GAC GAC GCA CTG CAG CAT CCG TAC ATC AAC GTT TGG TAC GAC CCG GCT 1407 
Asp Asp Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala 
355 360 365 
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GAA GTG GAG GCG CCT CCG CCT CAG ATA TAT GAT AAG CAG CTG GAT GAA 1455 
Glu Val Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu 
370 375 380 

AGG GAG CAC ACC ATC GAA GAA TGG AAA GAA CTT ATC TAC AAG GAG GTA 1503 
Ara Glu His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val 
385 390 395 

ATG AAC TCA GAA GAG AAG ACT AAG AAT GGC GTA GTC AAA AGC CAG CCC 1551 
Met Asn Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Ser Gin Pro 
400 405 410 

TCG CCT TCA GCA CAG GTG CAG CAG TGAACAGCAG TGAGAGTCTC CCTCCATCCT 1605 
Ser Pro Ser Ala Gin Val Gin Gin 
415 420 

CGGCTGTCAA CGACATCTCC TCCATGTCCA CCGACCAGAC CCTCGCATCT GACACTGACA 1665 

GCAGCCTGGA GGCCTCGGCG GGACCGTTGG GTTGTTGCAG GTGACTAGCC GCCTGCCTGC 1725 

GAAACCCAGC GTTCTTCAGG AGATGACGCG ATAGAACACA GCACACATGC ACACACACAG 1785 

CTTGCTCTCA CACACACTCA GCTTGCTCAC ACACACACAC ACACATACAC ACAAACACAC 1845 

ACTGTCTCTC TCTCACACAC ACACACTGTC ACAACGCACT CACGAAAGGT CAAGAAAAAA 1905 

ATAACAATAG AGAGATCCAA CATAAAATTA AGTTAAATTT TTCTGCGTGC TTCTCCAAAG 1965 

TTCTGTATCA CAGCTGAGCT GAAATGTATA CTTAACTTCT AGTTGCGCTC GCTTTGGTTT 2025 

CCCTCCAGCA GTGCTTACTA CACAAGACAA ATCAGACACA ATTAGAGAAA CCTTTCCCTA 2085 

AAGTGTAACT TAAGTGGCTG CAGAACCAGC AACCTGTAAC TGCCCTTCAA ATGGCATGAG 2145 

GAGGTGGGCA CGGGTCCCGC CCAGCATGTG TGTGTCTCTA TCTCGCGTCT ACCTGCTCTT 2205 

CCGGCCTAGT CAGATGGATG TAGATACAGA TCCCGCATGT GTCTGTATTC AAACAGCACT 2265 

TAGAGATGCT CCTGTCAGTG TCCTCCAGGC TCCACCAAGA CACACACCGG GGTACCACAT 2325 

GGTCCATTTC ATGTGATCTA TTACTCTGAC ATAAATCCAT CTGTAATATA TTGCCAGTAT 2385 

ATAAGCTGTT TAGTTTGTTA ATTGCTTAAG CTGTATGTCT TATAAGAGAC TATGTAAAGG 2445 

GGGAAAATGG AGGCGTGAAC TCTCAGACCC TTGAAGATGT AGCTTCCGAA TTTGACCGTT 2505 

AAATGGCACC GTATACC 2522 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

;lu Pro Thr Leu Asp Val 
15 

lis Val Asp Val Ser Ser 
30 

Jys Val Asp Asn Gin Phe 
45 

Chr Val Leu Lys Arg Tyr 
60 

51n Gly He Val Cys Ala 
75 ^ 80 



Met 


Ser 


Leu 


His 


Phe 


Leu 


Tyr 


Tyr 


Cys 


Ser 


1 








5 










10 


Lys 


He 


Ala 


Phe 


Cys 


Gin 


Gly 


Phe 


Asp 
25 


Lys 


He 


Ala 


Lys 


20 
His 


Tyr 
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Met 


Ser 


Lys 


Ser 






35 










40 




Phe 


Tyr 


Ser 


Val 


Glu 


Val 


Gly 


Asp 


Ser 


Thr 


50 










55 






Ala 


Gin 
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Leu 


Lys 


Pro 


He 


Gly 


Ser 


Gly 


65 








70 








Val 


Ala 


Tyr 


Asp 


Ala 


Val 


Leu 


Asp 


Arg 


Asn 






85 










90 


Arg 


Pro 


Phe 


Gin 


Asn 


Gin 


Thr 


His 


Ala 


Lys 






100 










105 




Val 


Leu 


Met 


Lys 


Cys 


Val 


Asn 


His 


Lys 


Asn 






115 








120 







95 



110 
Ser Leu 
125 
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Val Phe Thr Pro Gin Lys Thr Leu Glu Glu Phe Gin Asp Val Tyr Leu 

130 135 140 

Val Met Glu Leu Met Aap Ala Asn Leu Cys Gin Val lie Gin Met Glu 
145 150 155 160 

Leu Asp His Glu Arg Met Ser Tyr Leu Leu Tyr Gin Met Leu Cys Gly 

165 170 175 

lie Lys His Leu His Ser Ala Gly lie He His Arg Asp Leu Lys Pro 

180 185 190 

Ser Asn He Val Val Lys Ser Asp Cys Thr Leu Lys He Leu Asp Phe 

195 200 205 

Gly Leu Ala Arg Thr Ala Gly Thr Ser Phe Met Met Thr Pro Tyr Val 

210 215 220 

Val Thr Arg Tyr Tyr Arg Ala Pro Glu Val He Leu Gly Met Gly Tyr 
225 * 230 235 240 

Lys Glu Asn Val Asp He Trp Ser Val Gly Cys He Met Gly Glu Met 

1 245 250 255 

Val Arg His Lys He Leu Phe Pro Gly Arg Ser Tyr He Asp Gin Trp 

260 265 270 

Asn Lys Val He Glu Gin Leu Gly Thr Pro Cys Pro Glu Phe Met Lys 

275 280 285 

Lys Leu Gin Pro Thr Val Arg Asn Tyr Val Glu Asn Arg Pro Lys Tyr 

290 295 300 

Ala Gly Leu Thr Phe Pro Lys Leu Phe Pro Asp Ser Leu Phe Pro Ala 
305 310 315 320 

Asp Ser Glu His Asn Lys Leu Lys Ala Ser Gin Ala Arg Asp Leu Leu 

325 330 335 

Ser Lys Met Leu Val He Asp Pro Val Lys Arg He Ser Val Asp Asp 

340 345 350 

Ala Leu Gin His Pro Tyr He Asn Val Trp Tyr Asp Pro Ala Glu Val 

355 360 365 

Glu Ala Pro Pro Pro Gin He Tyr Asp Lys Gin Leu Asp Glu Arg Glu 

370 375 380 

His Thr He Glu Glu Trp Lys Glu Leu He Tyr Lys Glu Val Met Asn 
385 390 395 400 

Ser Glu Glu Lys Thr Lys Asn Gly Val Val Lys Ser Gin Pro Ser Pro 

405 410 415 

Ser Ala Gin Val Gin Gin 
420 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 615... 1616 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AGCAACTTTC CTGACCCAGA GGACCGGTAA CAAGTGGCCG GGAGCAACTT TTGCAAATCT 60 

CTTCTGCGCC TTAAGGCTGC CACCGAGACT GTAAAGAAAA GGAGAAGAGG AACCTATACT 120 

CATACCAGTT CGCACAGGCC TAAGTTGGGC GAGGCCTAGC CGCGGCTGCC TAGCGTCCCC 180 

CCCCCCCTCA CAGCGGAGGA GGGGACAGTT GTTGGAGGCC GGGCGGCAGA CCCGATCGCG 240 

GGCCTCCACC GAGAATTCCG TGACGACTGG TCAGCACCGC CGGAGAGCCG CTGTTGCTGG 300 

GACTGGTCTG CGGGCTCCAA GGAACCGCTG CTCCCCGAGA GCGCTCCGTG AGTGACCGCG 360 
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ACTTTTCAAA GCTCGGCATC GCGCGGAGTC CTACCAACGT GAGTGCTAGC GGAGTCTTAA 420 

CCCTGCGCTC CCTGGAGCGA ACTGGGGAGG AGGGCTCAGG GGGAAGCACT GCCGTCTGGA 480 

GCGCACGCTC TAAACAAACT TTGTTACAGA AGCAGGGACG CGCGGGTATC CCCCCGCTTC 540 

CCGGCGCGCT GTTGCGGCCC CGAAACTTCT GCGCACAGCC CAGGCTAACC CCGCGTGAAG 600 

TGACGGACCG TTCT ATG ACT GCA AAG ATG GAA ACG ACC TTC TAC GAC GAT 650 
Met Thr Ala Lys Met Glu Thr Thr Phe Tyr Asp Asp 
15 10 

GCC CTC AAC GCC TCG TTC CTC CAG TCC GAG AGC GGT GCC TAC GGC TAC 698 
Ala Leu Asn Ala Ser Phe Leu Gin Ser Glu Ser Gly Ala Tyr Gly Tyr 
15 20 25 

AGT AAC CCT AAG ATC CTA AAA CAG AGC ATG ACC TTG AAC CTG GCC GAC 746 
Ser Asn Pro Lys He Leu Lys Gin Ser Met Thr Leu Asn Leu Ala Asp 
30 35 40 

CCG GTG GGC AGT CTG AAG CCG CAC CTC CGC GCC AAG AAC TCG GAC CTT 794 
Pro Val Gly Ser Leu Lys Pro His Leu Arg Ala Lys Asn Ser Asp Leu 
45 50 55 60 

CTC ACG TCG CCC GAC GTC GGG CTG CTC AAG CTG GCG TCG CCG GAG CTG 842 
Leu Thr Ser Pro Asp Val Gly Leu Leu Lys Leu Ala Ser Pro Glu Leu 
65 70 75 

GAG CGC CTG ATC ATC CAG TCC AGC AAT GGG CAC ATC ACC ACT ACA CCG 890 
Glu Arq Leu He He Gin Ser Ser Asn Gly His He Thr Thr Thr Pro 
80 S5 90 

ACC CCC ACC CAG TTC TTG TGC CCC AAG AAC GTG ACC GAC GAG CAG GAG 938 
Thr Pro Thr Gin Phe Leu Cys Pro Lys Asn Val Thr Asp Glu Gin Glu 
95 100 105 

GGC TTC GCC GAG GGC TTC GTG CGC GCC CTG GCT GAA CTG CAT AGC CAG 986 
Gly Phe Ala Glu Gly Phe Val Arg Ala Leu Ala Glu Leu His Ser Gin 
110 " H5 120 

AAC ACG CTT CCC AGT GTC ACC TCC GCG GCA CAG CCG GTC AGC GGG GCG 1034 
Asn Thr Leu Pro Ser Val Thr Ser Ala Ala Gin Pro Val Ser Gly Ala 
125 130 135 140 

GGC ATG GTG GCT CCC GCG GTG GCC TCA GTA GCA GGC GCT GGC GGC GGT 1082 
Gly Met Val Ala Pro Ala Val Ala Ser Val Ala Gly Ala Gly Gly Gly 
145 150 155 

GGT GGC TAC AGC GCC AGC CTG CAC AGT GAG CCT CCG GTC TAC GCC AAC 1130 
Glv Glv Tvr Ser Ala Ser Leu His Ser Glu Pro Pro Val Tyr Ala Asn 
y 1 160 165 170 

CTC AGC AAC TTC AAC CCG GGT GCG CTG AGC AGC GGC GGT GGG GCG CCC 1178 
Leu Ser Asn Phe Asn Pro Gly Ala Leu Ser Ser Gly Gly Gly Ala Pro 
175 180 185 

TCC TAT GGC GCG GCC GGG CTG GCC TTT CCC TCG CAG CCG CAG CAG CAG 1226 
Ser Tvr Gly Ala Ala Gly Leu Ala Phe Pro Ser Gin Pro Gin Gin Gin 
190 195 200 

CAG CAG COG CCT CAG CCG CCG CAC CAC TTG CCC CAA CAG ATC CCG GTG 1274 
Gin Gin Pro Pro Gin Pro Pro His His Leu Pro Gin Gin He Pro Val 
205 210 215 220 
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CAG CAC CCG CGG CTG CAA GCC CTG AAG GAA GAG CCG CAG ACC GTG CCG 1322 
Gin His Pro Arg Leu Gin Ala Leu Lys Glu Glu Pro Gin Thr Val Pro 
225 230 235 

GAG ATG CCG GGA GAG ACG CCG CCC CTG TCC CCT ATC GAC ATG GAG TCT 1370 
Glu Met Pro Gly Glu Thr Pro Pro Leu Ser Pro He Asp Met Glu Ser 
240 245 250 

CAG GAG CGG ATC AAG GGA GAG AGG AAG CGC ATG AGG AAC CGC ATT GCC 1418 
Gin Glu Arg He Lys Ala Glu Arg Lys Arg Met Arg Asn Arg He Ala 
255 260 265 

GCC TCC AAG TGC CGG AAA AGG AAG CTG GAG CGG ATC GCT CGG CTA GAG 1466 
Ala Ser Lys Cys Arg Lys Arg Lys Leu Glu Arg He Ala Arg Leu Glu 
270 275 280 

GAA AAA GTG AAA ACC TTG AAA GCG CAA AAC TCC GAG CTG GCA TCC ACG 1514 
Glu Lys Val Lys Thr Leu Lys Ala Gin Asn Ser Glu Leu Ala Ser Thr 
285 290 295 300 

GCC AAC ATG CTC AGG GAA CAG GTG GCA CAG CTT AAG CAG AAA GTC ATG 1562 
Ala Asn Met Leu Arg Glu Gin Val Ala Gin Leu Lys Gin Lys Val Met 
305 310 315 

AAC CAC GTT AAC AGT GGG TGC CAA CTC ATG CTA ACG CAG CAG TTG CAA 1610 
Asn His Val Asn Ser Gly Cys Gin Leu Met Leu Thr Gin Gin Leu Gin 
320 325 330 

ACG TTT TGAGAACAGA CTGTCAGGGC TGAGGGGCAA TGGAAGAAAA AAAATAACAG 1666 
Thr Phe 



AGACAAACTT 
GCCAAGGGTA 
TGGGAAGGAC 
CAGACTTTGC 
ACCAAGAACT 
GTTACAAACT 
GAGGGCTGGG 
GGCCTGCCTC 
ATTACTGTCA 
GTCCTGCCCA 
AAGTATATAA 
TACAATAAAC 



GAGAACTTGA 
CACAAGATGG 
GTGGCGCGCC 
GGAGCGCTGT 
GCATGGACCT 
GCAATAGAGA 
AAGGGGGGGA 
TCTCAACTGT 
ATAAACAGCT 
GTGTTTGTAA 
TTTTTTTATG 
TATTGAAAAG 



CTGGTTGCGA 
ACTGGGTTCG 
TGGCTTTGGC 
GCCGCGCGCG 
AACATTCGAT 
CTGTAGATTG 
GGCTTGTAAG 
GTATGTACAT 
TCCTGCCTTT 
ATAAGAGATT 
TTTTGTTTCT 
T 



CAGAGAAAAA 
GACTGACGGC 
GTGGAGCCAG 
ACCAGAACGA 
CTCATTCAGT 
CTTCTGTAGT 
TGCCAGGCTA 
ATATATTTTT 
GTAAGTTATT 
TGAAGCATTC 
GAAAATTTCC 



AAAAGTGTCC 
GCCCCCAGTG 
AGAGCAGGCC 
TGGACTTTTC 
ATTAAAGGGG 
GCTCCTTAAC 
GACTGCAGAT 
TTTTAATTTG 
CCATGTTTGT 
TGAGTTTACC 
AGAAAGGATA 



GAGTACTGAA 
TGCTCTGGAG 
TATTGGCCGG 
GTTAACATTG 
GGTGGGAGGG 
ACAAAGCAGG 
GAACTCCCCT 
ATGAAAGCTG 
TTGTTTGGGT 
ATTTGTAATA 
TTTAAGAAAA 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Thr Ala Lys Met Glu Thr Thr Phe Tyr Asp Asp Ala Leu Asn Ala 

15 10 15 

Ser Phe Leu Gin Ser Glu Ser Gly Ala Tyr Gly Tyr Ser Asn Pro Lys 

20 25 30 



1726 
1786 
1846 
1906 
1966 
2026 
2086 
2146 
2206 
2266 
2326 
2347 
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lie Leu Lys Gin Ser Met Thr Leu Asn Leu Ala Asp Pro Val Gly Ser 

35 40 45 

Leu Lys Pro His Leu Arg Ala Lys Asn Ser Asp Leu Leu Thr Ser Pro 

SO 55 60 

Asp Val Gly Leu Leu Lys Leu Ala Ser Pro Glu Leu Glu Arg Leu lie 
65 70 75 80 

lie Gin Ser Ser Asn Gly His lie Thr Thr Thr Pro Thr Pro Thr Gin 

85 90 95 

Phe Leu Cys Pro Lys Asn Val Thr Asp Glu Gin Glu Gly Phe Ala Glu 

100 105 HO 

Glv Phe Val Arg Ala Leu Ala Glu Leu His Ser Gin Asn Thr Leu Pro 

* 115 120 125 

Ser Val Thr Ser Ala Ala Gin Pro Val Ser Gly Ala Gly Met Val Ala 

130 135 140 

Pro Ala val Ala Ser Val Ala Gly Ala Gly Gly Gly Gly Gly Tyr Ser 
145 150 155 160 

Ala Ser Leu His Ser Glu Pro Pro Val Tyr Ala Asn Leu Ser Asn Phe 

165 170 175 

Asn Pro Gly Ala Leu Ser Ser Gly Gly Gly Ala Pro Ser Tyr Gly Ala 

180 I 85 190 

Ala Gly Leu Ala Phe Pro Ser Gin Pro Gin Gin Gin Gin Gin Pro Pro 

195 200 205 

Gin Pro Pro His His Leu Pro Gin Gin He Pro Val Gin His Pro Arg 

210 215 220 

Leu Gin Ala Leu Lys Glu Glu Pro Gin Thr Val Pro Glu Met Pro Gly 
225 230 235 240 

Glu Thr Pro Pro Leu Ser Pro He Asp Met Glu Ser Gin Glu Arg He 

245 250 255 

Lvs Ala Glu Arg Lys Arg Met Arg Asn Arg He Ala Ala Ser Lys Cys 

* 260 265 270 

Arg Lys Arg Lys Leu Glu Arg He Ala Arg Leu Glu Glu Lys Val Lys 

275 280 285 

Thr Leu Lys Ala Gin Asn Ser Glu Leu Ala Ser Thr Ala Asn Met Leu 

290 295 300 

Ara Glu Gin Val Ala Gin Leu Lys Gin Lys Val Met Asn His Val Asn 
305 310 315 320 

Ser Gly Cys Gin Leu Met Leu Thr Gin Gin Leu Gin Thr Phe 
325 330 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3967 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATACCAGAGA CTCAAAAAAA AAAAAAAAGT TCCAGATTGC TGGACAATGA CCCGGGTCTC 60 

ATCCCTTGAC CCTGGGAACC GGGTCCACAT TGAATCAGGT GCGAATGTTC GCTCGCCTTC 120 

TCTGCCTTTC CCGCCTCCCC TCCCCCGGCC GCGGCCCCGG TTCCCCCCCT GCGCTGCACC 180 

CTCAGAGTTG GCTGCAGCCG GCGAGCTGTT CCCGTCAATC CCTCCCTCCT TTACACAGGA 240 

TGTCCATATT AGGACATCTG CGTCAGCAGG TTTCCACGGC CGGTCCCTGT TGTTCTGGGG 300 

GGGGGACCAT CTCCGAAATC CTACACGCGG AAGGTCTAGG AGACCCCCTA AGATCCCAAA 360 

TGTGAACACT CATAGGTGAA AGATGTATGC CAAGACGGGG GTTGAAAGCC TGGGGCGTAG 420 

AGTTGACGAC AGAGCGCCCG CAGAGGGCCT TGGGGCGCGC TTCCCCCCCC TTCCAGTTCC 480 

GCCCAGTGAC GTAGGAAGTC CATCCATTCA CAGCGCTTCT ATAAAGGCGC CAGCTGAGGC 540 

GCCTACTACT CCAACCGCGA CTGCAGCGAG CAACTGAGAA GACTGGATAG AGCCGGCGGT 600 
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TCCGCGAACG AGCAGTGACC GCGCTCCCAC CCAGCTCTGC TCTGCAGCTC CCACCAGTGT 660 

CTACCCCTGG ACCCCTTGCC GGGCTTTCCC CAAACTTCGA CCATGATGTT CTCGGGTTTC 720 

AACGCCGACT ACGAGGCGTC ATCCTCCCGC TGCAGTAGGG CCTCCCCGGC CGGGGACAGC 780 

CTTTCCTACT ACCATTCCCC AGCCGACTCC TTCTCCAGCA TGGGCTCTCC TGTCAACAGA 840 

GAGGTGAGTT TGGCTTTGTG TAGCCGCCAG GTCCGCGCTG AGGGTCGCCG TGGAGGAGAC 900 

ACTGGGGTGT GACTCGCAGG GGCGGGGGGG TCTTCCTTTT TCGCTCTGGA GGGAGACTGG 960 

CGCGGTCAGA GCAGCCTTAG CCTGGGAACC CAGGACTTGT CTGAGCGCGT GCACACTTGT 1020 

CATAGTAAGA CTTAGTGACC CCTTCCCGCG CGGCAGGTTT ATTCTGAGTG GCCTGCCTGC 1080 

ATTCTTCTCT CGGCCGACTT GTTTCTGAGA TCAGCCGGGG CCAACAAGTC TCGAGCAAAG 1140 

AGTCGCTAAC TAGAGTTTGG GAGGCGGCAA ACCGCGGCAA TCCCCCCTCC CGGGGCAGCC 1200 

TGGAGCAGGG AGGAGGGAGG AGGGAGGAGG GTGCTGCGGG CGGGTGTGTA AGGCAGTTTC 1260 

ATTGATAAAA AGCGAGTTCA TTCTGGAGAC TCCGGAGCAG CGCCTGCGTC AGCGCAGACG 1320 

TCAGGGATAT TTATAACAAA CCCCCTTTCG AGCGAGTGAT GCCGAAGGGA TAAOGGGAAC 1380 

GCAGCAGTAG GATGGAGGAG AAAGGCTGCG CTGGGGAATT CAAGGGAGGA TATTGGGAGA 1440 

GCTTTTATCT CCGATGAGGT GCATACAGGA AGACATAAGC AGTCTCTGAC CGGAATGCTT 1500 

CTCTCTCCCT GCTTCATGCG ACACTAGGGC CACTTGCTCC ACCTGTGTCT GGAACCTCCT 1560 

CGCTCACCTC CGCTTTCCTC TTTTTGTTTT GTTTCAGGAC TTTTGCGCAG ATCTGTCCGT 1620 

CTCTAGTGCC AACTTTATCC CCACGGTGAC AGCCATCTCC ACCAGCCCAG ACCTGCAGTG 1680 

GCTGGTGCAG CCCACTCTGG TCTCCTCCGT GGCCCCATCG CAGACCAGAG CGCCCCATCC 1740 

TTACGGACTC CCCACCCAGT CTGCTGGGGC TTAOGCCAGA GOGGGAATGG TGAAGACCGT 1800 

GTCAGGAGGC AGAGCGCAGA GGATCGGCAG AAGGGGCAAA GTAGAGGAGG TGAGCAGCGA 1860 

TTCTGGACCT TTGTGGGCTG GGGGGGGGGG GGGGGGCGGA GACTGAOGCA CAGACCACAC 1920 

AACAGAGAAG GGACGCTACT GACTGCACTT CCTGACCAGG AGCTGTGGCT GCTAGCCCTT 1980 

TCCCTCCCTT GTCAGATTTT GACAGTTGGA CCCAAGACAA ACTCTAGACA GTTTCCCTGA 2040 

CAGCTTCCTA CTTCATTCTC TAGCCGGGGA GCTTCTTTGT TCCCCTGCTA AAGATCTCAC 2100 

TTTAAATGCA AATCACACTC TGCCTGCCAA CTGCAGGTTA GAAAAACTGC TTCACCGAGA 2160 

GGTGCGGGTG CTGTAGGAGC CAGTTTCACT GGGGTGACTG AATGGAGGTG ACACTAGACA 2220 

ACCTTAACTG AATGTTGGTC CTTTTCTTCT ATAGCTATCT CCTGAAGAGG AAGAGAAACG 2280 

GAGAATCCGA AGGGAACGGA ATAAGATGGC TGCAGCCAAG TGCOGGAATC GGAGGAGGGA 2340 

GCTGAGAGAT ACACTCCAAG CGGTAGGTTG AACCAGCTGC TGCTCCTGAA ACTTTATTAA 2400 

AGTTGGAGCT TGGGACTATG GGOGCAGGGT CCTTGAGCAT GCCCGTGTCT TATGCTTTCT 2460 

TATATCTCTC CCTATGCAGG AGACAGATCA ACTTGAAGAT GAGAAGTCTG CGTTGCAGAC 2520 

TGAGATTGCC AATCTGCTGA AAGAGAAGGA AAAACTGGAG TTTATTTTGG CAGCCCACCG 2580 

ACCTGCCTGC AAGATCCCCG ATGACCTTGG CTTCCCAGAG GAGATGTCTG TGGCCTCCCT 2640 

GGATTTGACT GGAGGTCTGC CTGAGGCTTC CACCCCAGAG TCTGAGGAGG CCTTCACCCT 2700 

GCCCCTTCTC AACGACCCTG AGCCCAAGCC ATCCTTGGAG CCAGTCAAGA GCATCAGCAA 2760 

CGTGGAGCTG AAGGCAGAAC CCTTTGATGA CTTCTTGTTT CCGGCAXCAT CTAGGCCCAG 2820 

TGGCTCAGAG ACCTCCCGCT CTGTGCCAGA TGTGGACCTG TCCGGTTCCT TCTATGCAGC 2880 

AGACTGGGAG CCTCTGCACA GCAATTCCTT GGGGATGGGG CCCATGGTCA CAGAGCTGGA 2940 

GCCCCTGTGT ACTCCCGTGG TCACCTGTAC TCCGGGCTGC ACTACTTACA CGTCTTCCTT 3000 

TGTCTTCACC TACCCTGAAG CTGACTCCTT CCCAAGCTGT GCCGCTGCCC ACCGAAAGGG 3060 

CAGCAGCAGC AACGAGCCCT CCTCCGACTC CCTGAGCTCA CCCACGCTGC TGGCCCTGTG 3120 

AGCAGTCAGA GAAGGCAAGG CAGCCGGCAT CCAGACGTGC CACTGCCCGA GCTGGTGCAT 3180 

TAGAGAGAGG AGAAAGACGT CTTCCCTCGA AGGTTCCCGT CGACCTAGGG AGGACCTTAC 3240 

CTGTTCGTGA AACACACCAG GCTGTGGGCC TCAAGGACTT GCAAGCATCC ACATCTGGCC 3300 

TCCAGTCCTC ACCTCTTCCA GAGATGTAGC AAAAACAAAA CAAAACAAAA CAAAAAACCG 3360 

CATGGAGTGT GTTGTTCCTA GTGACACCTG AGAGCTGGTA GTTAGTAGAG CATGTGAGTC 3420 

AAGGCCTGGT CTGTGTCTCT TTTCTCTTTC TCCTTAGTTT TCTCATAGCA CTAACTAATC 3480 

TGTTGGGTTC ATTATTGGAA TTAACCTGGT GCTGGATTGT ATCTAGTGCA GCTGATTTTA 3540 

ACAATACCTA CTGTGTTCCT GGCAATAGCG TGTTCCAATT AGAAACGACC AATATTAAAC 3600 

TAAGAAAAGA TAGGACTTTA TTTTCCAGTA GATAGAAATC AATAGCTATA TCCATGTACT 3660 

GTAGTCCTTC AGCGTCAATG TTCATTGTCA TGTTACTGAT CATGCATTGT CGAGGTGGTC 3720 

TGAATGTTCT GACATTAACA GTTTTCCATG AAAACGTTTT T ATTGT GTTT TCAATTTATT 3780 

TATTAAGATG GATTCTCAGA TATTTATATT TTTATTTTAT TTTTTTCTAC CCTGAGGTCT 3840 

TTCGACATGT GGAAAGTGAA TTTGAATGAA AAATTTTAAG CATTGTTTGC TTATTGTTCC 3900 

AGGACATTGT CAATAAAAGC ATTTAAGTTG AATGCGACCA CCTTCTTGCT CTCTTTATTC 3960 

TCAGTTT 3967 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met Met 
1 

Cya Ser 

Pro Ala 

Phe Cys 
50 

Thr Ala 

65 
Leu Val 

Gly Leu 

Lys Thr 

Val Glu 
130 
Glu Arg 
145 

Leu Thr 

Ser Ala 

Leu Glu 

Asp Leu 
210 
Gly Gly 
225 

Leu Pro 

Lys Ser 

Leu Phe 

Val Pro 
290 
Pro Leu 
305 

Glu Pro 

Tyr Thr 

Ser Cys 

Ser Asp 
370 



Phe Ser Gly 
5 

Ser Ala Ser 
20 

Asp Ser Phe 
35 

Ala Asp Leu 

lie Ser Thr 

Ser Ser Val 
85 

Pro Thr Gin 

100 
Val Ser Gly 
115 

Gin Leu Ser 

Asn Lys Met 

Asp Thr Leu 
165 

Leu Gin Thr 

180 
Phe lie Leu 
195 

Gly Phe Pro 

Leu Pro Glu 

Leu Leu Asn 
245 

lie Ser Asn 

260 
Pro Ala Ser 
275 

Asp Val Asp 

His Ser Asn 

Leu Cys Thr 
325 

Ser Ser Phe 

340 
Ala Ala Ala 
355 

Ser Leu Ser 



Phe Asn 

Pro Ala 

Ser Ser 

Ser Val 
55 

Ser Pro 

70 
Ala Pro 

Ser Ala 

Gly Arg 

Pro Glu 
135 
Ala Ala 
150 

Gin Ala 

Glu He 

Ala Ala 

Glu Glu 
215 
Ala Ser 
230 

Asp Pro 

Val Glu 

Ser Arg 

Leu Ser 
295 
Ser Leu 
310 

Pro Val 

Val Phe 

His Arg 

Ser Pro 
375 



Ala Asp Tyr Glu 
10 

Gly Asp Ser Leu 
25 

Met Gly Ser Pro 
40 

Ser Ser Ala Asn 

Asp Leu Gin Trp 
75 

Ser Gin Thr Arg 
90 

Gly Ala Tyr Ala 
105 

Ala Gin Ser He 
120 

Glu Glu Glu Lys 

Ala Lys Cys Arg 
155 

Glu Thr Asp Gin 
170 

Ala Asn Leu Leu 
185 

His Arg Pro Ala 
200 

Met Ser Val Ala 

Thr Pro Glu Ser 
235 

Glu Pro Lys Pro 
250 

Leu Lys Ala Glu 
265 

Pro Ser Gly Ser 
280 

Gly Ser Phe Tyr 

Gly Met Gly Pro 
315 

Val Thr Cys Thr 
330 

Thr Tyr Pro Glu 
345 

Lys Gly Ser Ser 
360 

Thr Leu Leu Ala 



Ala Ser Ser 

Ser Tyr Tyr 
30 

Val Asn Thr 
45 

Phe He Pro 
60 

Leu Val Gin 

Ala Pro His 

Arg Ala Gly 
110 

Gly Arg Arg 

125 
Arg Arg He 
140 

Asn Arg Arg 

Leu Glu Asp 

Lys Glu Lys 
190 

Cys Lys He 

205 
Ser Leu Asp 
220 

Glu Glu Ala 

Ser Leu Glu 

Pro Phe Asp 
270 

Glu Thr Ser 

285 
Ala Ala Asp 
300 

Met Val Thr 

Pro Gly Cys 

Ala Asp Ser 
350 

Ser Asn Glu 
365 

Leu 
380 



Ser Arg 

15 
His Ser 

Gin Asp 

Thr Val 

Pro Thr 
80 

Pro Tyr 

95 
Met Val 

Gly Lys 

Arg Arg 

Arg Glu 
160 
Glu Lys 
175 

Glu Lys 

Pro Asp 

Leu Thr 

Phe Thr 
240 
Pro Val 
255 

Asp Phe 

Arg Ser 

Trp Glu 

Glu Leu 
320 
Thr Thr 
335 

Phe Pro 
Pro Ser 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAGAAATGGA GGCTCATAAA TACCACAGCT 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATTGGAAGAA GACCAAAGCA AGAGCAACTA 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TAAGTAAGTA GTGCTGTATG AATACAGACA 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TACTGGCAAT ATATTACAGA TGGGTTTATG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 
GTGTGCAGCT TATGATGCTA TTCTTGAA 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGCGTCACCA CATACGGAGT CATC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTGGAGGAGT TCCAAGATGT CTACT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
TGGAAAGAGC TTGGGGAAGG TGAG 
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